Developing Intonation Corpora for isiXhosa and isiZulu
نویسندگان
چکیده
In order to bring the tools of statistical pattern recognition to bear on intonation modelling, we need tailor-made corpora in the languages of interest. We describe how two such corpora were developed (for isiZulu and isiXhosa, respectively). We also show how those corpora can be used without further interpretation to gain insight into matters such as overall pitch contours and gender differences, and discuss the additional steps that will be required to create truly generative models from these corpora.
منابع مشابه
Pitch modelling for the Nguni languages
Although the complexity of prosody is widely recognised, the lack of widely-accepted descriptive standards for prosodic phenomena has meant that prosodic systems for most of the languages of the world have, at best, been described in impressionistic rule-based terms. For the languages of Southern Africa, the deficiencies in our modelling capabilities are acute. Little work of a quantitative nat...
متن کاملToward a knowledge-to-text controlled natural language of isiZulu
The language isiZulu belongs to the Nguni group of languages, which also include isiXhosa, isiNdebele and siSwati. Of the four Nguni languages, isiZulu is the most dominant language in South Africa, which is spoken by 22.7% of the country’s 51.8 million population. However, isiZulu (and even more so the other Nguni languages) still remains an under-resourced language for software applications. ...
متن کاملAutomatic intonation modeling with INTSINT
Accurate intonation modeling has become a vital part of modern day speech synthesis systems. This is especially true for tonal languages such as isiZulu, where the intonation of an utterance not only influences the perceived naturalness of the synthetic voice, but may also influence its semantics. In this work we explore the INTSINT intonation modeling algorithm and its application to an isiZul...
متن کاملPart-of-Speech Tagging and Chunking in Text-to-Speech Synthesis for South African Languages
Text-to-speech synthesis can be an empowering communication tool in the hands of the print-disabled or augmentative and alternative communication user. In an effort to improve the naturalness of synthesised speech – and thus enhance the communication experience – we apply the natural language processing tasks of part-of-speech tagging and chunking to the text in the synthesis process. We cover ...
متن کاملPhonetics of intonation in South African Bantu languages
Much is already known about the prosodic systems of the indigenous South African languages from descriptions and analyses in the existing literature. All of the existing work has been carried out in the field of African studies or formal linguistics. In order to be able to implement the generalisations obtained into computational models in speech processing, the existing sources and results mus...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005